Reinforcement learning

Results: 1147



#Item
281Behaviorism / Reinforcement / Learning / Applied behavior analysis / Positive behavior support

Functional Assessment and Intervention Design © SG Friedman, 1. Observe and operationally define the target behavior.

Add to Reading List

Source URL: www.behaviorworks.org

Language: English - Date: 2010-06-04 12:18:30
282Game artificial intelligence / Search algorithms / Minimax / Alphabeta pruning / Principal variation search / Artificial neural network / Variation / Reinforcement learning / Pruning / Game tree / TD-Gammon / Tree traversal

Reinforcement Learning Techniques in ’Jumper’ MITCHELL BRUNTON and SHAANAN N. COHNEY University of Melbourne General Terms: Machine Learning, Jumper

Add to Reading List

Source URL: cohney.info

Language: English - Date: 2016-01-24 20:49:37
283Markov models / Markov processes / Stochastic optimization / Mathematical optimization / Operations research / Reinforcement learning / Markov decision process / Algorithm / Multi-armed bandit / Dynamic programming / Shortest path problem / PP

Deterministic MDPs with Adversarial Rewards and Bandit Feedback Raman Arora TTIC 6045 S. Kenwood Ave. Chicago, IL 60637, USA

Add to Reading List

Source URL: dept.stat.lsa.umich.edu

Language: English - Date: 2012-09-12 18:50:24
284Mathematics / Mathematical analysis / Artificial intelligence / Backgammon / Rollout / Markov decision process / Multi-armed bandit / Reinforcement learning / Inverted pendulum / Pendulum / Prime-counting function / Valuation

Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric

Add to Reading List

Source URL: victorgabillon.nfshost.com

Language: English - Date: 2010-07-01 09:47:14
285

Deep Reinforcement Learning for Flappy Bird Kevin Chen Stanford University Abstract Reinforcement learning is essential for training an agent to

Add to Reading List

Source URL: cs229.stanford.edu

Language: English - Date: 2015-12-14 19:46:07
    286

    DAGGER and Friends References: 1. A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning, Ross, Gordon & BagnellDAGGER algorithm 2. Reinforcement and Imitation Learning via Inte

    Add to Reading List

    Source URL: rll.berkeley.edu

    Language: English - Date: 2015-10-15 00:39:05
      287

      Fear the REAPER: A System for Automatic Multi-Document Summarization with Reinforcement Learning Cody Rioux Sadid A. Hasan Yllias Chali University of Lethbridge Philips Research North America University of Lethbridge

      Add to Reading List

      Source URL: emnlp2014.org

      Language: English - Date: 2014-10-16 05:19:55
        288Multi-agent systems / Artificial intelligence / Academia / Systems science / Simulation / Belief revision / Agent-based model / Autonomous Agents and Multi-Agent Systems / Reinforcement learning

        Policy Communication for Coordination with Unknown Teammates Trevor Sarratt and Arnav Jhala University of California Santa Cruz {tsarratt, jhala}@soe.ucsc.edu Abstract

        Add to Reading List

        Source URL: mipc.inf.ed.ac.uk

        Language: English - Date: 2015-12-23 07:01:08
        289Algebra / Linear algebra / Mathematics / Markov models / Markov processes / Matrix theory / Matrices / Q-learning / Markov chain / Matrix / Reinforcement learning / Temporal difference learning

        An Emphatic Approach to the Problem of Off-policy Temporal-Difference Learning arXiv:1503.04269v1 [cs.LG] 14 MarRichard S. Sutton

        Add to Reading List

        Source URL: arxiv.org

        Language: English - Date: 2015-03-16 20:16:49
        290

        Scaling up Inverse Reinforcement Learning through Instructed Feature Construction Tomas Singliar Dragos D. Margineantu Boeing Research & Technology P.O. Box 3707, M/C 7L-44

        Add to Reading List

        Source URL: snowbird.djvuzone.org

        Language: English - Date: 2011-02-10 16:50:25
          UPDATE